Using Information Retrieval Methods for a Comparison of Algorithms to find differentially expressed Genes in Microarray Data
نویسنده
چکیده
PUL is a novel algorithm for the identification of differentially expressed genes in two group microarray experiments. PUL is compared to other popular algorithms using published implementations. The comparison is based on established measurements in information retrieval (Recall and Precision). Surprisingly a clear ordering in performance of the algorithms was observed. PUL outperformed other algorithms by a factor of two. PUL was applied successfully in different practical applications. For these experiments the importance of the genes proposed by PUL were independently verified.
منابع مشابه
A Comparison of Algorithms to Find Differentially Expressed Genes in Microarray Data
There are several different algorithms published for the identification of differentially expressed genes in DNA microarray experiments. Such algorithms produce ordered lists of genes. To compare the performance of these algorithms established measurements from Information Retrieval are proposed. A benchmark data set with known properties is generated and published. This benchmark data is used ...
متن کاملPredicting CpG Islands and DNA Methlation in the Cow Genome Using DNA Microarray Meta-Analysis and Genome Wide Scanning
DNA methylation is a type of epigenetic changes that directly affects DNA. In mammals, DNA methylation is essential for fetal development and stem cell differentiation and this phenomenon essentially occurs within the CpG islands. In this study, two methods were used to study the DNA methylation profile of cow genome. In the first method, the DNA methylation profile of the differentially expres...
متن کاملThe miR526b-5p-Related Single Nucleotide Polymorphisms, rs72618599, Located in 3\'-UTR of TCF3 Gene, is Associated with the Risk of Breast and Gastric Cancers
Introduction: Single nucleotide polymorphisms result in dysregulation of the proto-oncogene TCF3 gene, which is associated with the development, metastasis, and chemoresistance of different malignancies. Methods: GSE10810 microarray dataset and GEPIA2 online software were used to find differentially expressed genes and the TCF3 status in breast cancer (BC) and gastric cancer (GC), respectively....
متن کاملExtracellular exosomes and preeclampsia: a microarray-based study and functional enrichment analysis
Background: Preeclampsia (PE) is a heterogeneous pregnancy disease which the exact pathophysiology of it is unknown. Recently exosomes have been indicated as a causative factor in the pathogenesis of PE. The aim of the study was to investigate in microarray library data to extract the differentially expressed genes (DEGs) in PE and to perform a functional enrichment analysis to predict the rol...
متن کاملDiagnosis of the disease using an ant colony gene selection method based on information gain ratio using fuzzy rough sets
With the advancement of metagenome data mining science has become focused on microarrays. Microarrays are datasets with a large number of genes that are usually irrelevant to the output class; hence, the process of gene selection or feature selection is essential. So, it follows that you can remove redundant genes and increase the speed and accuracy of classification. After applying the gene se...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2007